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Abstract 

Represented as graphs, real networks are intricate combinations of or¬ 
der and disorder. Fixing some of the structural properties of network 
models to their values observed in real networks, many other properties 
appear as statistical consequences of these fixed observables, plus ran¬ 
domness in other respects. Here we employ the dk- series, a complete set 
of basic characteristics of the network structure, to study the statistical 
dependencies between different network properties. We consider six real 
networks—the Internet, US airport network, human protein interactions, 
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technosocial web of trust, English word network, and an fMRI map of 
the human brain—and find that many important local and global struc¬ 
tural properties of these networks are closely reproduced by dfe-random 
graphs whose degree distributions, degree correlations, and clustering are 
as in the corresponding real network. We discuss important conceptual, 
methodological, and practical implications of this evaluation of network 
randomness, and release software to generate dfc-random graphs. 


Introduction 

Network science studies complex systems by representing them as networks PQ. 
This approach has proven quite fruitful because in many cases the network 
representation achieves a practically useful balance between simplicity and real¬ 
ism: while always grand simplifications of real systems, networks often encode 
some crucial information about the system. Represented as a network, the sys¬ 
tem structure is fully specified by the network adjacency matrix, or the list of 
connections, perhaps enriched with some additional attributes. This (possibly 
weighted) matrix is then a starting point of research in network science. 

One significant line of this research studies various (statistical) properties of 
adjacency matrices of real networks. The focus is often on properties that convey 
useful information about the global network structure that affects the dynamical 
processes in the system that this network represents |2|. A common belief is 
that a self-organizing system should evolve to a network structure that makes 
these dynamical processes, or network functions, efficient |Mj. If this is the 
case, then given a real network, we may “reverse engineer” it by showing that its 
structure optimizes its function. In that respect the problem of interdependency 
between different properties becomes particularly important EHID3- 

Indeed, suppose that the structure of some real network has property X — 
some statistically over- or under-represented subgraph, or motif m , for example— 
that we believe is related to a particular network function. Suppose also that the 
same network has in addition property Y —some specific degree distribution or 
clustering, for example—and that all networks that have property Y necessar¬ 
ily have property X as a consequence. Property Y thus enforces or “explains” 
property X, and attempts to “explain” X by itself, ignoring Y. are misguided. 
For example, if a network has high density (property Y), such as the interarial 
cortical network in the primate brain where 66% of edges that could exist do 
exist m, then it will necessarily have short path lengths and high clustering, 
meaning it is a small-world network (properties X). However, unlike social net¬ 
works where the small-world property is an independent feature of the network, 
in the brain this property is a simple consequence of high density. 

The problem of interdependencies among network properties has been long 
understood mm.. The standard way to address it, is to generate many graphs 
that have property Y and that are random in all other respects—let us call them 
Y-random graphs —and then to check if property X is a typical property of these 
F-random graphs. In other words, this procedure checks if graphs that are 
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sampled uniformly at random from the set of all graphs that have property Y. 
also have property A with high probability. For example, if graphs are sampled 
from the set of graphs with high enough edge density, then all sampled graphs 
will be small worlds. If this is the case, then X is not an interesting property 
of the considered network, because the fact that the network has property X is 
a statistical consequence of that it also has property Y. In this case we should 
attempt to explain Y rather than A'. In case X is not a typical property of 
Y-random graphs, one cannot really conclude that property X is interesting or 
important (for some network functions). The only conclusion one can make is 
that Y cannot explain A, which does not mean however that there is no other 
property Z from which X follows. 

In view of this inherent and unavoidable relativism with respect to a null 
model, the problem of structure-function relationship requires an answer to 
the following question in the first place: what is the right base property or 
properties Y in the null model (Y-random graphs) that we should choose to 
study the (statistical) significance of a given property A in a given network [15) ? 
For most properties A including motifs d, the choice of Y is often just the 
degree distribution. That is, one usually checks if A is present in random 
graphs with the same degree distribution as in the real network. Given that 
scale-free degree distributions are indeed the striking and important features of 
many real networks JT], this null model choice seems natural, but there are no 
rigorous and successful attempts to justify it. The reason is simple: the choice 
cannot be rigorously justified because there is nothing special about the degree 
distribution—it is one of infinitely many ways to specify a null model. 

Since there exists no unique preferred null model, we have to consider a 
series of null models satisfying certain requirements. Here we consider a par¬ 
ticular realization of such series—the dk -series [16] , which provides a complete 
systematic basis for network structure analysis, bearing some conceptual simi¬ 
larities with a Fourier or Taylor series in mathematical analysis. The dfc-series is 
a converging series of basic interdependent degree- and subgraph-based proper¬ 
ties that characterize the local network structure at an increasing level of detail, 
and define a corresponding series of null models or random graph ensembles. 
These random graphs have the same distribution of differently sized subgraphs 
as in a given real network. Importantly, the nodes in these subgraphs are la¬ 
beled by node degrees in the real network. Therefore this random graph series 
is a natural generalization of random graphs with fixed average degree, degree 
distribution, degree correlations, clustering, and so on. Using dfc-series we ana¬ 
lyze six real networks, and find that they are essentially random as soon as we 
constrain their degree distributions, correlations, and clustering to the values 
observed in the real network (Y=degrees+correlations-(-clustering). In other 
words, these basic local structural characteristics almost fully define not only 
local but also global organization of the considered networks. These findings 
have important implications on research dealing with network structure-function 
interplay in different disciplines where networks are used to represent complex 
natural or designed systems. We also find that some properties of some net¬ 
works cannot be explained by just degrees, correlations, and clustering. The 
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dfc-series methodology thus allows one to detect which particular property in 
which particular network is non-trivial, cannot be reduced to basic local degree- 
or subgraph-based characteristics, and may thus be potentially related to some 
network function. 


Results 

The introductory remarks above instruct one to look not for a single base prop¬ 
erty Y, which cannot be unique or universal, but for a systematic series of base 
properties Yq, Yi, .... By “systematic” we mean the following conditions: 1) in¬ 
clusiveness, that is, the properties in the series should provide strictly more 
detailed information about the network structure, which is equivalent to requir¬ 
ing that networks that have property Yd (Y^-random graphs), d > 0, should also 
have properties Yd' for all d! = 0,1,..., d — 1; and 2) convergence , that is, there 
should exist property Y n in the series that fully characterizes the adjacency 
matrix of any given network, which is equivalent to requiring that Y/j-random 
graphs is only one graph—the given network itself. If these Y-series satisfy the 
conditions above, then whatever property X is deemed important now or later 
in whatever real network, we can always standardize the problem of explanation 
of X by reformulating it as the following question: what is the minimal value 
of d in the above Y-series such that property Yd explains XI By convergence, 
such d should exist; and by inclusiveness, networks that have property Y S! with 
any d! = d, d+ 1,..., D, also have property X. Assuming that properties Yd are 
once explained, the described procedure provides an explanation of any other 
property of interest X. 

The general philosophy outlined above is applicable to undirected and di¬ 
rected networks, and it is shared by different approaches, including motifs m, 
graphlets El, and similar constructions HU, albeit they violate the inclusive- 
ness condition as we show below. Yet one can still define many different Y-series 
satisfying both conditions above. Some further criteria are needed to focus on 
a particular one. One approach is to use degree-based tailored random graphs 
as null models for both undirected riWTI and directed [221 122] networks. The 
criteria that we use to select a particular Y-series in this study are simplic¬ 
ity and the importance of subgraph- and degree-based statistics in networks. 
Indeed, in the network representation of a system, subgraphs, their frequency 
and convergence are the most natural and basic building blocks of the system, 
among other things forming the basis of the rigorous theory of graph family 
limits known as graphons [23] , while the degree is the most natural and basic 
property of individual nodes in the network. Combining the subgraph- and 
degree-based characteristics leads to dk-series pjj. 

dk- series 

In dfc-series, properties Yd are dk-distributions. For any given network G of 
size N, its dfc-distribution is defined as a collection of distributions of G’s sub- 
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graphs of size d = 0,1,..., N in which nodes are labeled by their degrees in G. 
That is, two isomorphic subgraphs of G involving nodes of different degrees—for 
instance, edges (d = 2) connecting nodes of degrees 1,2 and 2,2—are counted 
separately. The Ok- “distribution” is defined as the average degree of G. Figure]!] 
illustrates the dfc-distributions of a graph of size 4. 

Thus defined the dk -series subsumes all the basic degree-based character¬ 
istics of networks of increasing detail. The zeroth element in the series, the 
Ok- “distribution,” is the coarsest characteristic, the average degree. The next 
element, the lfc-distribution, is the standard degree distribution, which is the 
number of nodes—subgraphs of size 1—of degree k in the network. The sec¬ 
ond element, the 2fc-distribution, is the joint degree distribution, the number 
of subgraphs of size 2—edges—between nodes of degrees fcq and k 2 - The 2k- 
distribution thus defines 2-node degree correlations and network’s assortativity. 
For d = 3, the two non-isomorphic subgraphs are triangles and wedges, com¬ 
posed of nodes of degrees fcq, k 2 , and fc 3 , which defines clustering, and so on. 
For arbitrary d the dfc-distribution characterizes the ‘d’egree ‘/correlations in d- 
sized subgraphs, thus including, on the one hand, the correlations of degrees of 
nodes located at hop distances below d, and, on the other hand, the statistics of 
d-cliques in G. We will also consider d/c-distributions with fractional d € (2,3) 
which in addition to specifying 2-node degree correlations (d = 2), fix some 
d = 3 substatistics related to clustering. 

The dk- series is inclusive because the (d+ l)fc-distribution contains the same 
information about the network as the dfc-distribution, plus some additional 
information. In the simplest d = 0 case for example, the degree distribu¬ 
tion P(k) (1/c-distribution) defines the average degree fc (O/c-distribution) via 
k = kP(k). The analogous expression for d = 1,2 are derived in Supple¬ 
mentary Note 1. 

It is important to note that if we omit the degree information, and just 
count the number of d-sized subgraphs in a given network regardless their node 
degrees, as in motifs ca, graplilets m, or similar constructions [T8], then such 
degree-/c-agnostic d-series (versus dk- series) would not be inclusive (Supplemen¬ 
tary Discussion). Therefore preserving the node degree (‘/c’) information is nec¬ 
essary to make a subgraph-based (‘d’) series inclusive. The d/c-series is clearly 
convergent because at d = N where N is the network size, the IVfc-distribution 
fully specifies the network adjacency matrix. 

A sequence of dfc-distributions then defines a sequence of random graph 
ensembles (null models). The dk-graphs are a set of all graphs with a given 
dfc-distribution, for example, with the dfc-distribution in a given real network. 
The dk-random graphs are a maximum-entropy ensemble of these graphs ns- 
This ensemble consists of all dfc-graphs, and the probability measure is uni¬ 
form (unbiased): each graph G in the ensemble is assigned the same probability 
P(G) = 1/Afd, where Md is the number of dfc-graphs. For d = 0,1,2 these 
are well studied classical random graphs Gn,m [2S], configuration model H3- 
125] , and random graphs with a given joint degree distribution [29], respectively. 
Since a sequence of dfc-distributions is increasingly more informative and thus 
constraining, the corresponding sequence of the sizes of dfc-random graph en- 
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sembles is non-increasing and shrinking to 1, Wo > Wi > ... > Wjv = 1, Fig. [T] 
At low d = 0,1, 2 these numbers A f can be calculated either exactly or approx¬ 
imately [30([M]. 

We emphasize that in dfc-graphs the dfc-distribution constraints are sharp, 
that is, they hold exactly—all dk- graphs have exactly the same dfc-distribution. 
An alternative description uses soft maximum-entropy ensembles belonging to 
the general class of exponential random graph models in which these con¬ 

straints hold only on average over the ensemble—the expected dfc-distribution in 
the ensemble (not in any individual graph) is fixed to a given distribution. This 
ensemble consists of all possible graphs G of size N, and the probability mea¬ 
sure P{G) is the one maximizing the ensemble entropy S = — J2 G -P(G) In P(G) 
under the d/c-distribution constraints. Using analogy with statistical mechanics, 
sharp and soft ensemble are often called nricrocanonical and canonical, respec¬ 
tively. 

As a consequence of the convergence and inclusiveness properties of dk- 
series, any network property X of any given network G is guaranteed to be 
reproduced with any desired accuracy by high enough d. At d = N all possible 
properties are reproduced exactly, but the WA-graph ensemble trivially consists 
of only one graph, Gself, and has zero entropy. In the sense that the entropy 
of dfc-ensembles Sd = InWd is a non-increasing function of d , the smaller the d, 
the more random the dk- random graphs, which also agrees with the intuition 
that d/c-random graphs are “the less random and the more structured,” the 
higher the d. Therefore the general problem of explaining a given property X 
reduces to the general problem of how random a graph ensemble must be so that 
X is statistically significant. In the dlc-series context, this question becomes: 
how much local degree information, that is, information about concentrations 
of degree-labeled subgraphs of what minimal size d, is needed to reproduce a 
possibly global property X with a desired accuracy? 

Below we answer this question for a set of popular and commonly used 
structural properties of some paradigmatic real networks. But to answer this 
question for any property in any network, we have to be able to sample graphs 
uniformly at random from the sets of dk- graphs—the problem that we discuss 
next. 
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Figure 1: The dfc-series illustrated, (a) shows the dfc-distributions for a 
graph of size 4. The 4/c-distribution is the graph itself. The 3/c-distribution 
consists of its three subgraphs of size 3: one triangle connecting nodes of degrees 

2, 2, and 3, and two wedges connecting nodes of degrees 2, 3, and 1. The 2k- 
distribution is the joint degree distribution in the graph. It specifies the number 
of links (subgraphs of size 2) connecting nodes of different degrees: one link 
connects nodes of degrees 2 and 2, two links connect nodes of degrees 2 and 

3, and one link connects nodes of degree 3 and 1. The 1/c-distribution is the 
degree distribution in the graph. It lists the number of nodes (subgraphs of 
size 1) of different degree: one node of degree 1, two nodes of degree 2, and 
one node of degree 3. The Ofc-distribution is just the average degree in the 
graph, which is 2. (b) illustrates the inclusiveness and convergence of dfc-series 
by showing the hierarchy of dfc-graplis, which are graphs that have the same 
dfc-distribution as a given graph G of size n. The black circles schematically 
shows the sets of dfc-graphs. The set of Ofc-graphs, that is, graphs that have the 
same average degree as G, is largest. Graphs in this set may have a structure 
drastically different from G’s. The set of 1/c-graphs is a subset of Ofc-graphs, 
because each graph with the same degree distribution as in G has also the same 
average degree as G, but not vice versa. As a consequence, typical lfc-graphs, 
that is, lfc-random graphs, are more similar to G than Ofc-graphs. The set of 
2/c-graphs is a subset of lfc-graphs, also containing G. As d increases, the circles 
become smaller because the number of different dk -graphs decreases. Since all 
the dfc-graph sets contain G, the circles “zoom-in” on it, and while their number 
decreases, dfc-graphs become increasingly more similar to G. In the d = n limit, 
the set of nfc-graphs consists of only one element, G itself. 




























c/Zc-random graph sampling 

Soft dfc-ensembles tend to be more amenable for analytic treatment, compared 
to sharp ensembles, but even in soft ensembles the exact analytic expressions 
for expected values are known only for simplest network properties in simplest 
ensembles [S3137]. Therefore we retreat to numeric experiments here. Given 
a real network G, there exist two ways to sample dfc-random graphs in such 
experiments: d/c-randomize G generalizing the randomization algorithms in [38l 
[39] , or construct random graphs with G’s dfc-sequence from scratch [16], 40], also 
called direct construction [4TM4j . 

The first option, dfc-randomization, is easier. It accounts for swapping ran¬ 
dom (pairs of) edges, starting from G, such that the dfc-distribution is preserved 
at each swap, Fig. [2j There are many concerns with this prescription [45] . two of 
which are particularly important. The first concern is if this process “ergodic,” 
meaning that if any two dfc-graphs are connected by a chain of dfc-swaps. For 
d = 1 the 2-edge swap is ergodic [35} [39], while for d = 2 it is not ergodic. 
However the so-called restricted 2-edge swap, when at least one node attached 
to each edge has the same degree, Fig. [2] was proven to be ergodic [3B]. It 
is now commonly believed that there is no edge-swapping operation, of this or 
other type, that is ergodic for the 3fc-distribution, although a definite proof is 
lacking at the moment. If there exists no ergodic 3fc-swapping, then we cannot 
really rely on it in sampling dfc-random graphs because our real network G can 
be trapped on a small island of atypical dfc-graphs, which is not connected by 
any dfc-swap chain to the main land of many typical d/c-graphs. Yet we note 
that in an unpublished work [47] we showed that five out of six considered real 
networks were virtually indistinguishable from their 3A:-randomizations across 
all the considered network properties, although one network (power grid) was 
very different from its 3/c-random counterparts. 

The second concern with dfc-randomization is about how close to uniform 
sampling the dk -swap Markov chain is after its mixing time is reached- its 
mixing time is yet another concern that we do not discuss here, but according 
to many numerical experiments and some analytic estimates, it is O(M) [T91 [291 
I35H401 [45] . Even for d = 1 the swap chain does not sample lfc-graphs uniformly 
at random, yet if the edge-swap process is done correctly, then the sampling is 
uniform [20L 211] . 

A simple algorithm for the second dfc-sampling option, constructing dk- 
graphs from scratch, is widely known for d = 1: given G’s degree sequence 
{h}, build a lfc-random graph by attaching ki half-edges (“stubs”) to node 
i, and then connect random pairs of stubs to form edges m- If during this 
process a self-loop (both stubs are connected to the same node) or double-edge 
(two edges between the same pair of nodes) is formed, one has to restart the 
process from scratch since otherwise the graph sampling is not uniform [4Sl . If 
the degree sequence is power-law distributed with exponent close to —2 as in 
many real networks, then the probability that the process must be restarted 
approaches 1 for large graphs [49], so that this construction process never suc¬ 
ceeds. An alternative greedy algorithm is described in [42] . which always quickly 


succeeds and gives an efficient way of testing if a given sequence of integers is 
graphical, that is, if it can be realized as a degree sequence of a graph. The 
base sampling procedure does not sample graphs uniformly, but then an im¬ 
portance sampling procedure is used to account for the bias, which results in 
uniform sampling. Yet again, if the degree distribution is a power law, one can 
show that even without importance sampling, the base sampling procedure is 
uniform, since the distribution of sampling weights that one can compute for 
this greedy algorithm approaches a delta function. Extensions of the naive 1 Re¬ 
construction above to 2k are less known, but they exist [16j !!29, 04] [50] ■ Most 
of these 2fc-constructions do not sample 2fc-graphs exactly uniformly either 05] , 
but importance sampling [50] 02] can correct for the sampling biases. 

Unfortunately, to the best of our knowledge, there currently exists no 3Re¬ 
construction algorithm that can be successfully used in practice to generate 
large 3Re-graphs with 3Re-distributions of real networks. The 3Re-distribution is 
quite constraining and non-local, so that the cRRe-construction methods described 
above for d = 1,2 cannot be readily extended to d = 3 mi There is yet another 
option—3Rc-targeting rewiring, Fig. [2] It is 2Rc-preserving rewiring in which 
each 2fc-swap is accepted not with probability 1, but with probability equal 
to min(l, exp(— /3AH)), where /3 is the inverse temperature of this simulated- 
annealing-like process, and AH is the change in the L 1 distance between the 3Re¬ 
distribution in the current graph and the target 3Rc-distribution before and after 
the swap. This probability favors and, respectively, suppresses 2Rc-swaps that 
move the graph closer or farther from the target 3fc-distribution. Unfortunately 
we report that in agreement with [40] this 2Rc-preserving 3Rc-targeting process 
never converged for any considered real network —regardless of how long we 
let the rewiring code run, after the initial rapid decrease, the 3Rc-distance, while 
continuing to slowly decrease, remained substantially large. The reason why this 
process never converges is that the 3Rc-distribution is extremely constraining, so 
that the number of 3Rc-graphs A 3 is infinitesimally small compared to the number 
of 2Rc-graphs A 2 , A/ 3 /A /2 1 pElEO]. Therefore it is extremely difficult for the 

3Rc-targeting Markov chain to find a rare path to the target 3Rc-distribution, and 
the process gets hopelessly trapped in abundant local minima in distance H. 

Therefore, on the one hand, even though 3Rc-randomized versions of many 
real networks are indistinguishable from the original networks across many met¬ 
rics m, we cannot use this fact to claim that at d = 3 these metrics are not 
statistically significant in those networks, because the 3Rc-randomization Markov 
chain may be non-ergodic. On the other hand, we cannot generate the corre¬ 
sponding 3Rc-random graphs from scratch in a feasible amount of compute time. 
The 3Rc-random graph ensemble is not analytically tractable either. Given that 
d = 2 is not enough to guarantee the statistical insignificance of some impor¬ 
tant properties of some real networks, see [47] and below, we, as in [40], retreat 
to numeric investigations of 2Rc-random graphs in which in addition to the 2k- 
distribution, some substatistics of the 3Rc-distribution is fixed. Since strong 
clustering is a ubiquitous feature of many real networks f|, one of the most 
interesting such substatistics is clustering. 

Specifically we study 2.1Rc-random graphs, defined as 2Rc-random graphs with 
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a given value of average clustering c, and 2.5/c-random graphs, defined as 2k- 
randorn graphs with given values of average clustering c(fc) of nodes of degree 
k [40] . The 3fc-distribution fully defines both 2.1fc- and 2.5/c-statistics, while 
2.5 k defines 2.1fc. Therefore 2/c-graphs are a superset of 2.1fc-graphs, which are 
a superset of 2.5fc-graphs, which in turn contain all the 3fc-graphs, A > A/ 2.1 > 
A/ 2.5 > A 3 . Therefore if a particular property is not statistically significant 
in 2.5fc-random graphs, for example, then it is not statistically significant in 
3/c-random graphs either, while the converse is not generally true. 

We thus generate 20 dfc-random graphs with d = 0,1, 2,2.1, 2.5 for each 
considered real network. For d = 0,1,2 we use the standard dk -randomizing 
swapping, Fig. [2] We do not use its modifications to guarantee exactly uniform 
sampling HE EH], because: ( 1 ) even without these modifications the swapping 
is close to uniform in power-law graphs, ( 2 ) these modifications are non-trivial 
to efficiently implement, and (3) we could not extend these modifications to the 
2 . 1 fc and 2.5/c cases. As a consequence, our sampling is not exactly uniform, but 
we believe it is close to uniform for the reasons discussed above. To generate dk- 
random graphs with d = 2.1, 2.5, we start with a 2fc-random graph, and apply 
to it the standard 2fc-preserving 2.a:fc-targeting (x = 1,5) rewiring process, 
Fig. [2j The algorithms that do that, as described in HI) did not converge 
on some networks, so that we modified the algorithm in HDj to ensure the 
convergence in all cases. The details of these modifications are in Supplementary 
Methods (the parameters used are listed in Supplementary Table 4), along with 
the details of the software package implementing these algorithms that we release 
to public m- 
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Figure 2: The dfc-sampling and convergence of dfc-series illustrated. 

The left column shows the elementary swaps of dk -randomizing (for d = 0,1, 2) 
and cWc-targeting (for d = 2.1, 2.5) rewiring. The nodes are labeled by their 
degrees, and the arrows are labeled by the rewiring acceptance probability. In 
dfc-randomizing rewiring, random (pairs of) edges are rewired preserving the 
graph’s dfc-distribution (and consequently its d'A'-distributions for all d' < d). 
In 2.1 k- and 2.5fc-targeting rewiring, the moves preserve the 2/c-distribution, but 
each move is accepted with probability p designed to drives the graph closer to 
a target value of average clustering c (2.1fc) or degree-dependent clustering c(k) 
(2.5 k): p = mm(l,e~P AH ), where /3 the inverse temperature of this simulated 
annealing process, AH = H a — Hb, and H a t, are the distances, after and before 
the move, between the current and target values of clustering: H 2 .ik = |c C urrent~ 
Ctargetl and H 2 , 5k = |c C urrent[fcz]-Ctarget[fci]|. The right column shows LaNet- 
vi [531 visualizations of the results of these dfc-rewiring processes (Supplementary 
Methods), applied to the PGP network, visualized at the bottom of the left 
column. The node sizes are proportional to the logarithm of their degrees, 
while the color reflects node coreness [65]. As d grows, the shown d£;-random 
graphs quickly become more similar to the real PGP network. 
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Real versus <i/c-random networks 

We performed an extensive set of numeric experiments with six real networks— 
the US air transportation network, an fMRI map of the human brain, the Inter¬ 
net at the level of autonomous systems, a technosocial web of trust among users 
of the distributed Pretty Good Privacy (PGP) cryptosystem, a human protein 
interaction map, and an English word adjacency network (Supplementary Note 
2 and Supplementary Table 3 present the analysed networks). For each network 
we compute its average degree, degree distribution, degree correlations, aver¬ 
age clustering, averaging clustering of nodes of degree k. and based on these 
dfc-statistics generate a number of dfc-random graphs as described above for 
each d = 0,1, 2,2.1, 2.5. Then for each sample we compute a variety of network 
properties, and report their means and deviations for each combination of the 
real network, d , and the property. Figures [3][6] present the results for the PGP 
network; Supplementary Note 3, Supplementary Figures 1-10, and Supplemen¬ 
tary Tables 1-2 provide the complete set of results for all the considered real 
networks. The reason why we choose the PGP network as our main example 
is that this network appears to be “least random” among the considered real 
networks, in the sense that the PGP network requires higher values of d to 
reproduce its considered properties. The only exception is the brain network. 
Some of its properties are not reproduced even by d = 2.5. 

Figure [2] visualizes the PGP network and its d/c-randomizations. The figure 
illustrates the convergence of dfc-series applied to this network. While the 0k- 
random graph has very little in common with the real network, the lfc-random 
one is somewhat more similar, even more so for 2k, and there is very little visual 
difference between the real PGP network and its 2.5fc-random counterpart. This 
figure is only an illustration though, and to have a better understanding of how 
similar the network is to its randomization, we compare their properties. 

We split the properties that we compare into the following categories. The 
microscopic properties are local properties of individual nodes and subgraphs of 
small size. These properties can be further subdivided into those that are de¬ 
fined by the dfc-distributions—the degree distribution, average neighbor degree, 
clustering, Fig. [3] -and those that are not fixed by the dfc-distributions—the con¬ 
centrations of subgraphs of size 3 and 4, Fig. [4} The mesoscopic properties — k- 
coreness and /c-density (the latter is also known as m-coreness or edge multiplic¬ 
ity, Supplementary Note 1), Fig. [5]— depend both on local and global aspects 
of network organization. Finally, the macroscopic properties are truly global 
ones—betweenness, the distribution of hop lengths of shortest paths, and spec¬ 
tral properties, Fig. [6] In Supplementary Note 3 we also report some extremal 
properties, such as the graph diameter (the length of the longest shortest path), 
and Kolmogorov-Smirnov distances between the distributions of all the con¬ 
sidered properties in real networks and their corresponding dfc-random graphs. 
The detailed definitions of all the properties that we consider can be found in 
Supplementary Note 1. 

In most cases—henceforth by “case” we mean a combination of a real net¬ 
work and one of its considered property—we observe a nice convergence of prop- 


12 


erties as d increases. In many cases there is no statistically significant difference 
between the property in the real network and in its 2.5/e-random graphs. In 
that sense these graphs, that is, random graphs whose degree distribution and 
degree-dependent clustering c(k) are as in the original network, capture many 
other important properties of the real network. 

Some properties always converge. This is certainly true for the microscopic 
properties in Fig. [3j simply confirming that our dfc-sampling algorithm oper¬ 
ates correctly. But many properties that are not fixed by the dfc-distributions 
converge as well. Neither the concentration of subgraphs of size 3 nor the dis¬ 
tribution of the number of neighbors common to a pair of nodes are fully fixed 
by dfc-distributions with any d < 3 by definition, yet 2.5fc-random graphs re¬ 
produce them well in all the considered networks. Most subgraphs of size 4 are 
also captured at d = 2.5 in most networks, even though d = 3 would not be 
enough to exactly reproduce the statistics of these subgraphs. We note that the 
improvement in subgraph concentrations at d = 2.5 compared to d = 2.1 is par¬ 
ticularly striking, Fig. [4} The mesoscopic and especially macroscopic properties 
converge more slowly as expected. Nevertheless, quite surprisingly, both meso¬ 
scopic properties (k -coreness and fc-density) and some macroscopic properties 
converge nicely in most cases. The k- coreness, /c-density, and the spectral prop¬ 
erties, for instance, converge at d = 2.5 in all the considered cases other than 
Internet’s Fiedler value. In some cases a property, even global one, converges 
for d lower than 2.5. Betweenness, for example, a global property, converges at 
d = 1 for the Internet and English word network. 

Finally, there are “outlier” networks and properties of poor or no d/c-convergence. 
Many properties of the brain network, for example, exhibit slow or no conver¬ 
gence. We have also experimented with community structure inferred by differ¬ 
ent algorithms, and in most cases the convergence is either slow or non-existent 
as one could expect. 
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Figure 3: Microscopic properties of the PGP network and its dk- 
random graphs. The figure shows (a) the degree distribution P(k), (b) 
average degree k nn (k) of nearest neighbors of nodes of degree k, (c) average 
clustering c(k) of nodes of degree k, (d) the distribution P(m) of the number 
m of common neighbors between all connected pairs of nodes, and (e,f,g) the 
means and (h,i,j) standard deviations of the corresponding distributions. The 
error bars indicate the standard deviations of the metrics across different graph 
realizations. 
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Figure 4: The densities of subgraphs of size 3 and 4 in the PGP net¬ 
work and its dfc-random graphs. The two different graphs of size 3 and six 
different graphs of size 4 are shown on each panel. The numbers on top of pan¬ 
els are the concentrations of the corresponding subgraph in the PGP network, 
while the histogram heights indicate the average absolute difference between the 
subgraph concentration in the d/c-random graphs and its concentration in the 
PGP network. The subgraph concentration is the number of given subgraphs 
divided by the total number of subgraphs of the same size. The error bars are 
the standard deviations across different graph realizations. 
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Figure 6: Macroscopic properties of the PGP network and its dk- 
random graphs. The figure shows (a) the average betweenness b{k) of nodes 
of degree k , (b) the distribution P{1) of hop lengths l of the shortest paths be¬ 
tween all pairs of nodes, the (c,d) means and ( f >g) standard deviations of the 
corresponding distributions, (e) the largest eigenvalues of the adjacency ma¬ 
trix A, and (h) the Fiedler value, which is the spectral gap (the second largest 
eigenvalue) of the graph’s Laplacian matrix L = D — A, where D is the degree 
matrix, D i:t = Sijki, 6,j the Kronecker delta, and ki the degree of node i. The 
error bars indicate the standard deviations of the metrics across different graph 
realizations. 
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Discussion 


In general, we should not expect non-local properties of networks to be exactly 
or even closely reproduced by random graphs with local constraints. The consid¬ 
ered brain network is a good example of that this expectation is quite reasonable. 
The human brain consists of two relatively weakly connected parts, and no dk- 
randomization with low d is expected to reproduce this peculiar global feature, 
which likely has an impact on other global properties. And indeed we observe 
in Supplementary Note 3 that its two global properties, the shortest path dis¬ 
tance and betweenness distributions, differ drastically between the brain and its 
dfc-randomizations. 

Another good example is community structure, which is not robust with 
respect to dfc-randomizations in all the considered networks. In other words, 
dfc-randomizations destroy the original peculiar cluster organization in real net¬ 
works, which is not surprising, as clusters have too many complex non-local 
features such as variable densities of internal links, boundaries, etc., which dk- 
randomizations, even with high d , are expected to affect considerably. 

Surprisingly, what happens for the brain and community structure does not 
appear representative for many other considered combinations of real networks 
and their properties. As a possible explanation, one can think of constraint- 
based modeling as a satisfiability (SAT) problem: find the elements of the ad¬ 
jacency matrix (1/0, True/False) such that all the given constraints in terms 
of the functions of the marginals (degrees) of this matrix are obeyed. We then 
expect that the 3fc-constraints already correspond to an NP-hard SAT problem, 
such as 3-SAT, with hardness coming from the global nature of the constraints 
in the problem. However, many real-world networks evolve based mostly on 
local dynamical rules and thus we would expect them to contain correlations 
with d < 3, that is, below the NP-hard barrier. The primate brain, however, has 
likely evolved through global constraints, as indicated by the dense connectivity 
across all functional areas and the existence of a strong core-periphery structure 
in which the core heavily concentrates on areas within the associative cortex, 
with connections to and from all the primary input and subcortical areas 112 j . 

However, in most cases, the considered networks are dfc-random with d < 2.5, 
that is, d < 2.5 is enough to reproduce not only basic microscopic (local) prop¬ 
erties but also mesoscopic and even macroscopic (global) network properties [B]- 
ITOj . This finding means that these more sophisticated properties are effectively 
random in the considered networks, or more precisely, that the observed values 
of these properties are effective consequences of particular degree distributions 
and, optionally, degree correlations and clustering that the networks have. This 
further implies that attempts to find explanations for these complex but ef¬ 
fectively random properties should probably be abandoned, and redirected to 
explanations of why and how degree distributions, correlations, and cluster¬ 
ing emerge in real networks, for which there already exists a multitude of ap¬ 
proaches [5^1157] . On the other hand, the features that we found non-random 
do require separate explanations, or perhaps a different system of null models. 

We reiterate that the dfc-randomization system makes it clear that there 
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is no a priori preferred null model for network randomization. To tell how 
statistically significant a particular feature is, it is necessary to compare this 
feature in the real network against the same feature in an ensemble of random 
graphs, a null model. But one is free to choose any random graph model. In 
particular, any d defines a random graph ensemble, and we find that many prop¬ 
erties, most notably the frequencies of small subgraphs that define motifs m, 
strongly depend on d for many considered networks. Therefore choosing any 
specific value of d, or more generally, any specific null model to study the sta¬ 
tistical significance of a particular structural network feature, requires some 
non-trivial justification before this feature can be claimed important for any 
network function. 

Yet another implication of our results is that if one looks for network topology 
generators that would veraciously reproduce certain properties of a given real 
network -a task that often comes up in as diverse disciplines as biology j58j and 
computer science |59| —one should first check how dfc-random these properties 
are. If they are d/c-random with low d , then one may not need any sophisti¬ 
cated mission-specific topology generators. The dfc-random graph generation 
algorithms discussed here can be used for that purpose in this case. We note 
that there exists an extension of a subset of these algorithm for networks with 
arbitrary annotations of links and nodes [BD]—directed or colored (multilayer) 
networks, for instance. 

The main caveat of our approach is that we have no proof that our dk- 
random graph generation algorithms for d = 2.1 and d = 2.5 sample graphs 
uniformly at random from the ensemble. The random graph ensembles and 
edge rewiring processes employed here are known to suffer from problems such 
as degeneracy and hysteresis [36J (6T ; , 62J ■ Ideally, we would wish to calculate 
analytically the exact expected value of a given property in an ensemble. This 
is currently possible only for very simple properties in soft ensembles with d = 
0,1, 2 [36] |37|. Some mathematically rigorous results are available for d = 0,1 
and for some exponential random graph models [28] [34] . Many of these results 
rely on graphons [24] that are applicable to dense graphs only, while virtually all 
real networks are sparse [55j. Some rigorous approaches to sparse networks are 
beginning to emerge [631 ML but the rigorous treatment of global properties, 
which tend to be highly non-trivial functions of adjacency matrices, in random 
graph ensembles with d > 2 constraints, appear to be well beyond the reach 
in the near future. Yet if we ever want to fully understand the relationship 
between the structure, function, and dynamics of real networks, this future 
research direction appears to be of a paramount importance. 
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Supplementary Figure 7: Degree distributions in real networks and their 
dfc-randomizations. 
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Supplementary Figure 8: Average nearest neighbor degrees (ANNDs) of 
nodes of a given degree in real networks and their d/c-randomizations. 
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in real networks and their ^-randomizations. 
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Supplementary Figure 10: Subgraph concentration differences betweesn 
d/e-randomizations and real networks. 
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Supplementary Figure 11: Common neighbor distributions in real net¬ 
works and their dfc-randomizations. 
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Supplementary Figure 12: /c-coreness distributions in real networks and 
their d/c-randomizations. 
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Supplementary Figure 13: fc-denseness distributions in real networks and 
their dfc-randomizations. 
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Supplementary Figure 14: Average betweenness of nodes of a given de¬ 
gree in real networks and their dfc-randomizations. 
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Supplementary Figure 15: Shortest path distance distributions in real 
networks and their dfc-randomizations. 
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Supplementary Figure 16: Kolmogorov-Smirnov distances between real 
networks and their dfc-randomizations. 













































































































































































Supplementary Tables 


Supplementary Table 1: Largest eigenvalues, averaged across different 
realizations for each d, and their standard deviations in parentheses. 



Original 

Ok 

lk 

2k 

2.1 k 

2.5 k 

AIR 

48.07 

12.97(0.08) 

42.41(0.24) 

47.46(0.01) 

47.51(0.02) 

47.82(0.03) 

BRAIN 

119.66 

8.91(0.01) 

54.89(0.26) 

113.41(0.02) 

114.09(0.06) 

122.27(0.20) 

WORDS 

109.44 

13.06(0.02) 

104.12(0.28) 

108.82(0.03) 

108.80(0.04) 

108.92(0.02) 

INTERNET 

67.17 

5.36(0.01) 

56.02(0.33) 

61.15(0.03) 

61.32(0.06) 

65.34(0.10) 

PGP 

42.44 

5.77(0.02) 

19.50(0.24) 

34.08(0.03) 

34.40(0.05) 

42.95(0.12) 

PPI 

38.56 

8.05(0.05) 

32.47(0.17) 

34.07(0.04) 

34.05(0.04) 

35.56(0.10) 


Supplementary Table 2: Spectral gaps, averaged across different realiza¬ 
tions for each d, and their standard deviations in parentheses. 



Original 

0A 

IK 

2 K 

2.1A 

2.5 A' 

AIR 

29.34 

6.04(0.09) 

32.61(0.46) 

37.86(0.25) 

37.21(0.21) 

30.93(0.29) 

BRAIN 

40.97 

2.90(0.06) 

35.52(0.31) 

77.53(0.11) 

76.59(0.27) 

42.71(0.35) 

WORDS 

65.31 

5.86(0.02) 

65.28(0.51) 

68.53(0.14) 

68.47(0.12) 

68.21(0.15) 

INTERNET 

17.56 

0.70(0.05) 

14.94(0.53) 

18.83(0.07) 

18.55(0.11) 

19.53(0.25) 

PGP 

4.25 

0.98(0.04) 

5.51(0.31) 

18.01(0.18) 

17.55(0.21) 

4.71(0.19) 

PPI 

11.69 

2.25(0.07) 

15.75(0.27) 

16.44(0.19) 

16.28(0.20) 

10.76(0.17) 


Supplementary Table 3: The considered networks, their abbreviations, 
and the numbers of nodes and links in them. 


Network 

Abbr. 

N 

M 

US air transportation network [66] 

AIR 

500 

2,980 

Brain network pJTl 

BRAIN 

17,455 

67,895 

English word network JJS] 

WORDS 

7,377 

44,205 

Internet AS-level [SH] 

INTERNET 

20,906 

42,994 

PGP web of trust [70] 

PGP 

10,680 

24,316 

Protein interaction network 171] 

PPI 

4,099 

13,355 
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Supplementary Table 4: Parameters used for the dfc-randomization (left) 
and 2.1fc/2.5fc-targeting 2/c-preserving (right) rewiring processes (M 
the number of edges in the real network, c average clustering, c(k) 
average clustering of nodes of degree k). 


Gt 

d/c-randomization 

p-targeting dfc-preserving rewiring 

R 

d 

PGt 

d 

R 

/?o 

ftfactor 

a 

AIR 

100 M 

1/2 

c, c(k) 

2 

10 M 

10" 2 

1.4 

5•10“ 4 

BRAIN 

100 M 

1/2 

c, c(k ) 

2 

10 M 

10" 2 

it 

5•10 -5 

WORDS 

100 M 

1/2 

c, c(k) 

2 

10 M 

10" 2 

1.4 

5•10^ 4 

INTERNET 

100 M 

1/2 

c, c(k ) 

2 

10 M 

10" 2 

1.4 

5•10^ 4 

PGP 

100 M 

1/2 

c, c(k) 

2 

lOOAf 

10" 2 

1.1 

5•10 -7 

PPI 

100 M 

1/2 

c, c(k ) 

2 

200M 

10" 2 

1.1 

5•10 -7 


Supplementary Table 5: dfc-series vs. d-series 


d 

dfc-statistics 

d-statistics 

0 

k 

- 

1 

N(k) 

N 

2 

N{k, k') 

M 

3 

N A (k,k',k") 

N A {k,k',k") 

W 

T 
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Supplementary Notes 

Supplementary Note 1: Network properties 

Here we describe all the network properties measured and discussed in Supple¬ 
mentary Note 3 and, where meaningful, their relations to dfc-series. 


1.1 Degree distribution 

The distribution P{k) of node degrees k, i.e., the lfc-distribution, is: 


m 


N{k) 

N 


( 1 ) 


where N(k) is the number of nodes of degree k in the network, and N is the 
total number of nodes in it, so that P{k) is normalized, J2 k P{k) = 1. The 
1/c-distribution fully defines the Ofc-distribution, i.e., the average degree k in the 
network, by 

ic = Y , kp ( k ), ( 2 ) 

k 


but not vice versa. 


1.2 Average nearest neighbor degree (ANND) 

The average degree k nn {k) of nearest neighbors of nodes of degree k is a com¬ 
monly used projection of the joint degree distribution (JDD) P(k,k'), i.e., the 
2/c-distribution. The JDD is defined as 

P(fc, k') = /i(fc, k') ~ ’ ( 3 ) 

where N(k,k') = N(k',k) is the number of links between nodes of degrees k 
and k' in the network, M is the total number of links in it, and 


M&, k ') 


2 if k = k\ 

1 otherwise, 


(4) 


so that P(k,k') is normalized, P(k, k') = 1. The 2fc-distribution fully 

defines the lfc-distribution by 

m = ( 5 ) 

k' 


but not vice versa. The average neighbor degree k nn {k) is a projection of the 
2/c-distribution P(k. k') via 


k nn( k ) 


k 

kP(k) 


J2 k ' p ( k ’ k ') 


Em k'PW) 

E k> p ( k ’ k ') 


( 6 ) 
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1.3 Clustering 


Clustering of node i is the number of triangles A * it belongs to, or equivalently 
the number of links among its neighbors, divided by the maximum such number, 
which is k{k — l)/2, where k is i’s degree, deg(i) = k. The average clustering 
coefficient of the network is 


1 A,; 

N ^ ki(ki - l)/2 


(7) 


Averaging over all nodes of degree k, the degree-dependent clustering is 


c{k) 


2A (k) 

k{k - 1 )N(k) 


, where A (k) 


E 

i: deg(i)=/c 


( 8 ) 


The degree-dependent clustering is a commonly used projection of the 3k- 
distribution. (See [721 l73| for an alternative formalism involving three point 
correlations.) The 3fc-distribution is actually two distributions characterizing 
the concentrations of the two non-isomorphic degree-labeled subgraphs of size 3, 
wedges and triangles: 

k 


k' k" k' k" 

Let N A (k',k,k") = N A (k",k,k') be the number wedges involving nodes of de¬ 
grees k, k', and k ", where k is the central node degree, and let N A {k,k’ ,k") 
be the number of triangles consisting of nodes of degrees k, k !, and k ", where 
N A (k,k' ,k") is assumed to be symmetric with respect to all permutations of 
its arguments. Then the two components of the 3/T-distribution are 


p A {k’,k,k") 

= 

(9) 

p A (k,k',k") 

= v{k,k',k") NA{k ^ k "\ 

6i 

(10) 


where W and T are the total numbers of wedges and triangles in the network, 
and 

f 6 if k = k! = k ", 

v{k , k f , k") = < 1 if k ^ k!k", (11) 

[2 otherwise, 

so that both P A (k ', k, k") and Pa (A:, k !, k") are normalized, J2 k k' k" Pd^ , k , k") 
k" Pa (A, k 1 , k") = 1. The 3fc-distribution defines the 2/c-distribution (but 
not vice versa), by 

k" v 

+ ^[p A (k',k,k")+p A (k,k',k")]y ( 12 ) 
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The normalization of 2k- and 3fc-distributions implies the following identity 
between the numbers of triangles, wedges, edges, nodes, and the second moment 
of the degree distribution k 2 = k 2 P(k ): 


0 3 T+W + M 

""" ~lsr 


= k 2 . 


(13) 


The degree-dependent clustering coefhcient c(k) is the following projection of 
the 3fc-distribution 


_ 6TEk',k" P Mk,k',k") 
~ N k(k - l)P(fc) 


(14) 


1.4 Subgraph frequencies 


The concentration of subgraphs of size 3 is exactly fixed only by the 3fc-distribution, 
or by the 3-distribution, Supplementary Note 4. There are two non-isomorphic 
connected graphs of size 3 (triangles and wedges), and their concentrations are 
defined as 


c = A 

A AT, 



(15) 


where A is the number of wedges in the graph, A is the number of triangles in 
the graph, and N :i = A + A is the total number of connected subgraphs of size 
3 in the graph. 

The concentration of subgraphs of size 4 is exactly fixed only by the 4 k- 
distribution, or by the 4-distribution. There are six non-isomorphic connected 
graphs of size 4, 



and their concentrations are defined as the number of subgraphs of a particular 
type divided by the total number of connected subgraphs of size 4. 

In our comparisons of real networks and their dfc-raridornizations in Sup¬ 
plementary Note 3 we choose to compare the subgraph concentrations directly, 
versus computing z-scores, as common in the motif literature. The reasons for 
this decision is that z-scores are tailored for a fixed null model, while we are 
considered a series of null models parameterized by d in dk- series. There is 
nothing in the z-score and dfc-series definitions that could easily provide any 
estimates of how fast the subgraph frequency means and standard deviations in 
the z-score definition converge as functions of d. Therefore the comparisons of 
z-scores for different values of d would be meaningless. 


1.5 Common neighbors 

The number rn t j of common neighbors between two connected nodes i and j is 
the number of nodes to which both i and j are connected, or equivalently the 
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multiplicity of edge (i,j): 



(16) 


where {A^} is the adjacency matrix of the graph. The distribution P(to) of 
the number of common neighbors m is then 



(17) 


where <5 is the Kronecker delta. The common neighbor distribution is thus the 
probability that two connected nodes in the graph have m common neighbors. 
This property is exactly fixed only by the 3fc-distribution. 

1.6 fc-coreness and fc-denseness 

The fc-core decomposition m of a graph is a set of nested subgraphs induced 
by nodes of the same fc-coreness. A node has fc-coreness equal to fc if it belongs 
to a maximal connected subgraph of the original graph, in which all nodes have 
degree at least k , i.e., in which each node is connected to at least k other nodes 
in the subgraph. 

Similarly, the fc-dense decomposition eg of a graph is a set of nested sub¬ 
graphs induced by edges of the same fc-denseness. An edge has fc-denseness 
equal to k if it belongs to a maximal connected subgraph of the original graph, 
in which all edges have multiplicity [71 El [75] at least k— 1, i.e., in which each 
pair of connected nodes has at least fc — 1 common neighbors in the subgraph. 

Both the fc-core and fc-dense decompositions rely on the analysis of local 
properties of nodes and edges. However, due to the recursive nature of these 
decompositions, the dfc-distributions with d = 0,1, 2, 2.1, 2.5 do not exactly fix 
either the fc-core or fc-dense distributions. 

1.7 Betweenness 

Betweenness b(i) of node * is a measure of how “important” i is in terms of the 
number of shortest paths passing through it. Formally, if <r st (i ) is the number of 
shortest paths between nodes s ^ i and 1/* that pass through *, and a s t is the 
total number of shortest paths between the two nodes s / 1, then betweenness 
of i is 



(18) 


Averaging over all nodes of degree fc, degree-dependent betweenness b(k) is 



(19) 
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1.8 Shortest path distance 


The distance distribution is the distribution of hop-lengths of shortest path 
between nodes in a network. Formally, if N(h) is the number of node pairs 
located at hop distance h from each other, then the distance distribution P(h) 
is 


P(h) 


N(h) 

N(N — l)/2 ’ 


( 20 ) 


where N(N — l)/2 is the total number of nodes pairs in the network. The 
average distance is: 

h = Y J P{h). ( 21 ) 

h 


Finally, the network diameter, i.e., the maximum hop distance between nodes 
in the network, is 


d = max(ft). 


( 22 ) 


1.9 Spectral properties 

The adjacency matrix of graph A gives the full information on the structure of 
the graph. The largest eigenvalue of A and the spectral gap, which is defined 
as the difference between the largest and second largest eigenvalue A , play im¬ 
portant roles in the dynamic processes on networks. For instance, the largest 
eigenvalue of the adjacency matrix is related to the speed of the spreading pro¬ 
cesses on the network [771 Eg, while the gap determines the speed of convergence 
of the random walk to its steady state m- 

The Laplacian matrix describes the diffusion of a random walker on the net¬ 
work and is defined as L = D — A, where D is the diagonal matrix of degrees 
Dij = Sijki, Sij is Kronecker delta and ki is the degree of node i. The smallest 
eigenvalue of the Laplacian matrix is associated to stationary distribution of 
random walker and it is always equal to zero, while the smallest non-zero eigen¬ 
value, Fiedler value, defines the time scale of the slowest mode of the diffusion 

m- 

Supplementary Note 2: Considered networks 

We apply the dfc-series analysis to the following six social, biological, language, 
communication, and transportation networks, Table [3j 

• AIR. The US air transportation network S6B'. The nodes are airports, and 
there is a link between two airports if there is a direct flight between them. 

• BRAIN. The largest connected component of an fMRI map of the hu¬ 
man brain m- The nodes are voxels (small areas of a resting brain of 
approximately 36mm 3 volume each), and two voxels are connected if the 
correlation coefficient of the fMRI activity of the voxels exceed 0.7. 
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• WORDS. The largest connected component of the network of adjacent 
words in Charles Darwin’s “The Origin of Species” [65]. The nodes are 
words, and two words are connected if they are adjacent in the text. 

• INTERNET. The topology of the Internet at the level of Autonomous 
Systems (ASes) [65]. The nodes are ASs (organizations owing parts of the 
Internet infrastructure), and there is a link between two ASs if they have 
a business relationship to exchange Internet traffic. 

• PGP ( considered in the main text ). The largest strongly connected com¬ 
ponent of the technosocial web of trust relationships among people ex¬ 
tracted from the Pretty Good Privacy (PGP) data 170]. The nodes are 
PGP certificates of users, and there is a link between two certificates if 
their users mutually trust each other’s certificate/user associations. 

• PPI. The largest connected component of the human protein interaction 
network m- The nodes are proteins, and there is a link between two 
proteins if they interact. 

Table[4]reports the parameters used for each network in the dfc-randomization 
and p-targeting dfc-preserving rewiring processes. 

Supplementary Note 3: Results 

Degree distribution. We observe in Fig.[7]that while Ofc-randomizations are 
way off, the dfc-random graphs with d > 1 reproduce the degree distributions 
in the real networks exactly, which is by definition: the lfc-distribution is the 
degree distribution, and dfc-random graphs with d > 1 have exactly the same 
degree distributions as the real networks. 

Average nearest neighbor degree (ANND). We observe in Fig. [8] that 
while Ofc-randomizations are way off, the lfc-random graphs tend to be closer 
to the real networks in terms of ANND, whereas the dfc-random graphs with 
d > 2 have exactly the same average neighbor degrees as the real networks, 
which is again by definition: the dfc-random graphs with d > 2 have exactly 
the same JDD P(fc,fc') as the real networks. In the WORDS, INTERNET, and 
PPI cases, the ANNDs k nn (k) even in the lfc-random graphs do not noticeably 
differ from the ANNDs in the real networks. 

Clustering. We observe in Fig.[9]that degree-dependent average clustering in 
the 2.5fc-random graphs matches the one in the real networks, which is again 
by definition. For d < 2.5, degree-dependent clustering differs sensibly in many 
cases. However, degree-dependent clustering in the AIR network does not ex¬ 
hibit noticeable differences with its 2.1fc-randomizations, while in the WORDS 
case, even the lfc-random graphs reproduce degree-depended clustering nearly 
exactly. 
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Subgraph frequencies. We observe in Fig. 10 that the 2/e-random graphs 
reproduce the subgraphs frequencies in most cases, but the BRAIN and PGP 
require d = 2.5 to reproduce these frequencies. 


Common neighbors. We observe in Fig. El that the lfc-random graphs re¬ 
produce the common neighbor distributions in all the cases except the BRAIN, 
which requires d = 2, and PGP, which requires d = 2.5. 


fc-coroncss and fc-denseness. We observe in Fig. [12] that the 2fc-random 
graphs reproduce the fc-coreness distributions in all the networks except the 


PGP and BRAIN that require d = 2.5. We observe in Fig. 1_3 that the 2.5 k- 
randorn graphs reproduce the fc-denseness distributions in all the networks. The 
/c-denseness distributions in the AIR and WORDS networks are reproduced even 
by their 2/c-random graphs. 


Betweenness. We observe in Fig. 14 that betweenness in the BRAIN network 
cannot be approximated even by its 2.5fc-random graphs. The INTERNET lies 
at the other extreme: even the lfc-random graphs reproduce its betweenness. 
The PGP network requires all the constraints imposed by the 2.5fc-distribution, 
while betweenness in all the other networks is similar to betweenness in their 
2/c-random graphs. 


Shortest path distance. We observe in Fig. 15 that the distance distribu¬ 
tions in the INTERNET and WORDS networks are correctly reproduced by 
their lfc-random graphs. Even d = 2.5 is not enough for the BRAIN, while the 
same value of d = 2.5 suffices for all the networks. 


Spectral properties. We observe in Table [T] that the largest eigenvalue of 
the adjacency matrix is closely, although not exactly, reproduced by d = 2.5 k- 
randorn graphs for all six networks. Furthermore, we observe that the largest 
eigenvalues for 2fc-random graphs of AIR and WORDS networks are very close 
to the eigenvalues of the original networks. 

The values of the spectral gaps for 2.5fc-random graphs shown in Table [2] are 
relatively close to the values observed for the original networks, with relative 
difference for AIR, BRAIN and WORDS networks around 5%. The large values 
of the spectral gaps for 2k and 2.1fc-randonr graphs indicate that they are more 
robust, in the sense of being better connected and interlinked, compared to the 
original networks. 

Kolmogorov-Smirnov distance. In Fig. [l6]we quantify the convergence of 
d/c-series in terms of Kolmogorov-Smirnov (KS) distances between the distribu¬ 
tions of per-node values of a given property in the real networks and the same 
distributions in their dfc-random graphs. We report the KS distances for the 
following properties: 
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k degree, cf. Fig.[7J 
knn ANND, cf. Fig. § 
c clustering, cf. Fig. [9] 

comm.neigh common neighbors, cf. Fig. [Tl] 

kcore /e-coreness, cf. Fig. |T2| 

kdense /..--density, cf. Fig. [I3| 

bet betweenness, cf. Fig. [14} 

path-len shortest path distance, cf. Fig. [I5| 

The Kolmogorov-Smirnov distance between two cumulative distribution func¬ 
tions (CDFs) Fi(x) and F 2 (x) is 

D = sup\F 1 (x) - F 2 (x)\. (23) 

X 

In our case, F\ (x) is the per-node CDF of a given property in a real network, and 
F 2 (x) is the per-node CDF for the same property computed across all different 
dfc-random graph realizations for the network with a given d. We note that the 
KS distances provides more detailed statistics than the dfc-distributions, because 
the latter do not differentiate between nodes of the same degree, while the former 
do. For example, even if the 2/c-distributions and consequently ANNDs k nn (k) 
in two different networks are exactly the same, the distributions of average 
degrees ki^nn of neighbors of each individual node i, i = 1,..., N, are in general 
different, so that the KS distance between the two per-node ANND CDFs is in 
general greater than zero. 


Supplementary Discussion 

We compare dk -series with the series based on subgraph frequencies, and show 
that the latter cannot form a systematic basis for topology analysis. 

The difference between dk -series and subgraph-based-series, which we can 
call d-series, is that the former is the series of distributions of d-sized subgraphs 
labeled with node degrees in a given network, while the d-series is the distri¬ 
butions of such subgraphs in which this degree information is ignored. This 
difference explains the mnemonic names for these two series: ‘d’ in l dk' refers to 
the subgraph size, while ‘fc’ signifies that they are labeled by node degrees— l k’ 
is a standard notation for node degrees. 

This difference between the dfc-series and d-series is crucial. The dk -series are 
inclusive, in the sense that the (d+l)fc-distribution contains the full information 
about the dfc-distribution, plus some additional information, which is not true 
for d-series. 
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To see this, let us consider the first few elements of both series in Table[5j In 
Supplementary Note 1 we show explicitly how the (d + l)fc-distributions define 
the dfc-distribution for d = 0,1, 2. The key observation is that the d-series does 
not have this property. The O’th element of d-series is undefined. For d = 1 we 
have the number of subgraphs of size 1, which is just N, the number of nodes 
in the network. For d = 2, the corresponding statistics is M, the number of 
links, subgraphs of size 2. Clearly, M and N are independent statistics, and the 
former does not define the latter. For d = 3, the statistics are W and T, the 
total number of wedges and triangles, subgraphs of size 3, in the network. These 
do not define the previous element M either. Indeed, consider the following two 
networks of size N —the chain and the star: 

1 


1 

There are no triangles in either network, T = 0. In the chain network, the 
number of wedges is IF = N — 2, and in the star IF = ( N — 1 )(N — 2)/2. We 
see that even though W (d = 3) scales completely differently with N in the two 
networks, the number of edges M = N — 1 (d = 2) is the same. 

In summary, d-series is not inclusive. For each d, the corresponding element 
of the series reflects a differen kind of statistical information about the network 
topology, unrelated or only loosely related to the information conveyed by the 
preceding elements. At the same time, similar to dfc-series, the d-series is also 
converging since at d = N it specifies the whole network topology. However, 
this convergence is much slower that in the dfc-series case. In the two networks 
considered above, for example, neither W = N — 2, T = 0 nor W = ( N — 
1)(N — 2)/2, T = 0, fix the network topology as there are many non-isomorphic 
graphs with the same (IF, T ) counts, whereas the 3fc-distributions IV A (1, 2, 2) = 
2, N a ( 2,2,2) = N-4 and iV A (l, N- 1,1) = (N - 1)(N - 2)/2 define the chain 
and star topologies exactly. 

The node degrees thus provide necessary information about subgraph lo¬ 
cations in the original network, which significantly speeds up convergence as 
a function of d, and more importantly makes the dk- series basis inclusive and 
systematic. 



Supplementary Methods 

The methods that we use to sample dfc-random graphs for a given graph rep¬ 
resenting a real network are based on two different rewiring processes: dk- 
randomizing rewiring (d = 0,1,2) and p-targeting d/c-preserving rewiring (p = 
2.1k,2.5k). 

The first method (d/c-randomization) consists of swapping random pairs of 
edges in the original network preserving its dfc-distribution, Algorithm [T] The 
following three input parameters are required: Gt the original graph, R the 
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number of rewirings to apply, and d index that indicates the dfc-distribution 
to preserve. The random edge selection function on line 4 and the rewiring 
function on line 5 depend on d as follows: 

• if d = 0, random edge (i,j) and non-edge (o, 6) (disconnected nodes a 
and b) are selected, and the rewiring consists of removing edge (i,j) and 
adding edge (a, b ). 

• if d = 1, two random edges (i,j) and (a, b) are selected and discarded if 
either edge ( i,b ) or edge (j, a) exists; if neither edge ( i,b ) nor edge (j, a) 
exists, the rewiring consists of removing edges (/, j) and (a, 6), and adding 
edges (i,b) and (j,a). 

• if d = 2, two random edges ( i,j ) and (a, b) such that degrees ki = k a 
are selected and discarded if either edge (*,&) or edge (j, a) exists; if nei¬ 
ther edge ( i , b) nor edge (j, a) exists, the associated rewiring consists of 
removing edges (*, j) and (a, 6) and adding edges (i,6) and (j, a). 


Supplementary Algorithm 1: dfc-randomization process. 


Input: Gt\ 

Input: R ; 

Input: d; 

/* 1. Initialize process variables 

1 i = 0 ; 

2 Go = G r ; 

/* 2. Apply R dk- rewirings 

3 while i < R do 

/* Select a random pair of edges (see the text) 

4 rew = random(edges 6 GJ; 

/* Apply the rewiring to Gt 

5 apply_rewiring(Gi, rew); 

e i = i + 1; 

Output: Gi\ 


II Original graph 
// Humber of rewirings 
// d value: 0, 1, or 2 
*1 

II Rewiring counter 
II Graph to rewire 
*/ 

*/ 

*/ 


// dfc-randomized graph 


The second method of (p-targeting ^/.-preserving rewiring) is based on 
simulated annealing, and consists of two phases: randomization and targeting 
rewiring, Algorithm [2] The following input parameters are required: Gt the 
original graph, pc T the property to target, R the number of dfc-rewirings to 
apply at each value of temperature, /3o the initial inverse temperature, f3factor 
the rate of temperature decrease, and a the acceptance threshold. In the first 
phase the original graph is 2fc-randomized by Algorithm [l] In the second phase, 
the obtained 2fc-random graph is 2/c-rewired, but each rewiring is accepted with 
probability min[exp(— f3H), 1] which depends on current values of energy H and 
temperature 1//3. Energy is defined as the distance between the values of prop¬ 
erty p in the original and current rewired graphs. Temperature is high initially, 
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but each round of R rewirings (line 9), it decreases by factor /3/acfon thus de¬ 
creasing the probability of accepting a rewiring that increases energy. This 
second phase terminates when either energy is zero, meaning that the value of 
p-property in the rewired graph p G . is equal to its value in the original graph 
Pg t i or when the percentage of accepted rewirings during the last round falls 
below a user-specified threshold a. Function compute_property(G) appear¬ 
ing on lines 3 and 12 returns average clustering c or average degree-dependent 
clustering c(fc) of G depending on whether d = 2.1 or d = 2.5, respectively. 
Energy function distance (pGi,PG T ^ appearing on lines 4and 13 depends on d 
as follows: 

•if d= 2.1, distance (p G .,p GT ) = |c Gi - c G t |, 

•if d= 2.5, distance (p G .,p GT ) = J^k I CGi(k) -c Gr (/c) |. 

Code availability We release the software package that implements the dk- 
randomization algorithms described above. The code is freely available at http: 
//polcolomer.github.io/RandNetGen/ [ST]. 


43 


Supplementary Algorithm 2: ^-targeting d/c-preserving rewiring pro¬ 
cess. 


// Original graph 
// Number of rewirings 
// Target value of the property 
// Initial inverse temperature 
// Temperature decrease rate 
// Acceptance threshold 
*/ 


Input: Gt', 

Input: R; 

Input: p Gr ; 

Input: fa; 

Input: Pfactor\ 

Input: a; 

/* 1. 2/c-randomize the original graph using Alg. 2 

1 M = num.edges(G:r); 

2 Go = dfc-randomize(G t. R=100M, d = 2); 

/* 2. Compute the current value of the property */ 

3 pg 0 = compute property (G 0 ) ; 

/* 3. Compute the current energy, the distance between pq q and Pg t */ 

4 H 0 = distance (p Go ,p Gr ) ; 

/* 4. Initialize process variables */ 

5 i = 0; // Accepted rewirings counter 

6 Step = 0; // Counter associated with current temperature 

7 0step ~~ 00 ) 


8 A, 


step 


= 1 ; 


// Acceptance ratio, percentage of accepted rewirings that 


modified energy at current temperature 

/* 5. 2/c-rewire until energy goes to zero or the acceptance ratio falls 
below the threshold 


*/ 


9 while Hi > 0 and A 3tep > a do 

/* Apply R p-targeting 2/c-rewirings at constant temperature 1 /(3 s tcp 


10 

11 

12 

13 


for r «— 1 to R do 

/* Apply a single 2/c-rewiring to the current graph, using Alg. 2 

Ghyp - <2fc-randomize(Gi, 1, 2); 

Phyp = compute_property (.G hyp ) ; 

/* Compute current energy 
Hhyp~ distance (phypi PGt ^» 


14 


/* Compute energy change 

A H = Hhyp ~ Hi; 


15 

16 

17 

18 
19 


/* Accept graph Gh yp with the following probability 

with probability min(e“^ step A ^, 1) do: 

r = r - 1-1; 

i = i + 1; 

— Ghypi 
_ Hi — Hhyp] 


*/ 

*/ 


*/ 

*1 

*/ 


20 

21 

22 


0step-\-l = 0step * 0factory 

a _ #rewirings accepted with Ai/^0 . 

step+1 #rewirings with Aff^O 1 

step = step + 1; 


Output: Gi 


// Update f3 
II Update the acceptance ratio 
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